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ABSTRACT 

We  study  the  problem  of  selecting  the  highest  probability  of  success  from  among 
several  negative  binomial  distributions  via  the  nonparametric  empirical  Bayes  approach. 
A  monotone  selection  rule  is  proposed  on  basis  of  monotone  empirical  Bayes  estimators 
of  the  negative  binomial  success  probabilities  which  are  obtained  by  using  the  antitonic 
and  isotonic  regression  methods.  The  asymptotic  optimality  property  of  the  proposed 
empirical  Bayes  selection  rule  is  also  established. 


Key  Words  and  Phrases:  Asymptotically  optimal;  empirical  Bayes;  monotone  estimator; 
monotone  selection  rule;  isotonic  and  antitonic  regression. 
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1.  INTRODUCTION 


In  many  situations,  an  experimenter  is  often  confronted  with  choosing  a  model  which 
is  the  best  in  some  sense  among  those  under  study.  For  example,  consider  k  different 
competing  drugs  for  a  certain  ailment.  One  would  like  to  select  the  best  among  them  in 
the  sense  that  it  has  the  highest  probability  of  success  (cure  of  the  ailment).  This  kind  of 
selection  problem  occurs  in  many  fields,  such  as  medicine,  engineering,  and  sociology.  The 
reader  is  referred  to  Gupta  and  Panchapakesan  (1979)  for  further  discussions  on  goals  and 
procedures  for  this  selection  problem. 

Now,  consider  a  situation  in  which  one  will  be  repeatedly  dealing  with  the  same  selec¬ 
tion  problem  independently.  In  such  instances,  it  is  reasonable  to  formulate  the  component 
problem  in  the  sequence  as  a  Bayes  decision  problem  with  respect  to  an  unknown  prior 
distribution  on  the  parameter  space.  One  then  uses  the  accumulated  observations  to  im¬ 
prove  the  decision  rule  at  each  stage.  This  is  the  empirical  Bayes  approach  of  Robbins 
(1956,1964).  During  the  last  thirty  years,  empirical  Bayes  methods  have  been  studied 
extensively.  Many  such  empirical  Bayes  rules  have  been  shown  to  be  asymptotically  op¬ 
timal  in  the  sense  that  the  risk  for  the  nth  component  decision  problem  converges  to  the 
minimum  Bayes  risk  which  would  have  been  obtained  if  the  prior  distribution  was  known 
and  the  Bayes  rule  with  respect  to  this  prior  distribution  was  used. 

Empirical  Bayes  rules  have  been  derived  for  subset  selection  goals  by  Deely  (1965). 
Recently,  Gupta  and  Liang  (1986, 1988a, 1988b)  have  studied  empirical  Bayes  rules  for 
selecting  binomial  and  negative  binomial  populations  better  than  a  standard  or  a  control 
and  for  selecting  the  best  among  several  binomial  populations.  They  have  assumed  that 
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the  form  of  the  prior  distributions  are  completely  unknown.  Hence,  those  approaches 
are  referred  to  as  nonparametric  empirical  Bayes.  Gupta  and  Liang  (1989a, 1989b)  have 
also  studied  some  other  empirical  Bayes  selection  rules,  in  which  they  assumed  that  the 
form  of  the  prior  distribution  is  known  but  the  distributions  depend  on  certain  unknown 
hyperparameters.  Such  approach  is  therefore  referred  to  as  parametric  empirical  Bayes. 

In  this  paper,  we  are  concerned  with  the  problem  of  selecting  the  highest  probabil¬ 
ity  of  success  among  several  negative  binomial  distributions  through  the  nonparametric 
empirical  Bayes  approach.  The  selection  rule  is  based  on  monotone  empirical  Bayes  es¬ 
timators  of  the  negative  binomial  success  probabilities.  The  framework  of  the  empirical 
Bayes  selection  problem  is  formulated  in  Section  2.  Using  the  isotonic  regression  method, 
monotone  empirical  Bayes  estimators  as  well  as  a  monotone  empirical  Bayes  selection  rule 
are  proposed  in  Section  3.  Finally,  the  asymptotic  optimality  property  of  the  proposed 
empirical  Bayes  selection  rule  is  studied  in  Section  4. 

2.  FORMULATION  OF  THE  EMPIRICAL  BAYES  APPROACH 

Consider  k(>  2)  independent  negative  binomial  populations  ?ri ,  - . . ,  tt*  .  For  each 
i  =  1, . . . ,  k,  let  pi  denote  the  probability  of  success  for  each  trial  in  it i  and  let  AT,  denote 
the  number  of  successes  before  attaining  the  rth  failure  in  7 rt.  We  assume  that  for  each 
i  =  1  ,...,&,  the  trials  in  tt*  are  mutually  independent.  Thus,  conditional  on  p,-,  Xi\p%  has 
a  negative  binomial  distribution  with  probability  function  /»(x|p,),  where 

fi{x\pi)  =  +  ^PfU  -P»)r»  x  =  0,1,2,....  (2.1) 

k 

Let  /(x|p)  =  []  /*(x*IpO>  where  x  =  (x!,. ..,**)  and  p  =  (pi,...,p*).  For  each  p,  let 

«=i 
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P[i]  <  ...  <  p\k\  denote  the  ordered  values  of  the  parameters  pi,...,pjfc.  It  is  assumed 
that  the  exact  pairing  between  the  ordered  and  the  unordered  parameters  is  unknown.  A 
population  7 r,-  with  p,-  =  pj*]  is  referred  to  as  a  best  population.  Our  goal  is  to  derive 
empirical  Bayes  rules  to  select  the  best  population.  The  empirical  Bayes  framework  of  the 
selection  problem  is  formulated  as  follows. 

Let  fl  =  {p  =  (pi, . . .  ,Pfe)|pi  €  (0,1),  i  =  1,  ...,&}  be  the  parameter  space  and 
k 

let  G(p)  =  17  <j,(p,)  be  the  prior  distribution  over  H,  where  G,(*)  are  unknown  for  all 

»=i 

i  =  l,...,/:.  Note  that  under  this  model,  p,’s  are  assumed  to  be  independently  distributed. 
Let  A  =  {* |t  =  1,...,/:}  be  the  action  space.  When  action  t  is  taken,  it  means  that 
population  rr,  is  selected  as  the  best  population.  For  the  parameter  p  and  the  action  t,  the 
loss  function  L(p,  t)  is  defined  as: 


L(p,i)  =  p[k]  -  pi,  (2.2) 

the  difference  between  the  best  and  the  selected  population. 

Let  X  be  the  sample  space  generated  by  X  =  (Xj,...,Xfc).  A  selection  rule  d  = 

(di,...,dfc)  is  defined  to  be  a  mapping  from  the  sample  space  X  to  [0, l]fc  such  that  for 

each  observation  x  =  (xi,...,x*),  the  function  d(x)  =  (^(x), . . .  ,djt(x))  satisfies  that 

k 

0  <  d{(x)  <  1  for  all  t  =  1 ,...,/:,  and  Y2  d,(x)  =  1.  Note  that  d, (x)  is  the  probability  of 

i=l 

selecting  population  tt,  as  the  best  population  when  x  is  observed.  Let  D  be  the  set  of  all 

selection  rules  defined  previously.  For  each  de  D,  let  r(G ,  d)  denote  the  associated  Bayes 

risk.  Then,  r(G)  =  inf  r(G,  d)  is  the  minimum  Bayes  risk  among  the  class  D,  and  a  rule, 
d£D 

say  do,  is  called  a  Bayes  selection  rule  if  r(G,da)  =  r(G). 
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Based  on  the  preceding  statistical  model,  the  Bayes  risk  associated  with  the  selection 


rule  d  is: 


r  _  ^ 

{G,d)=  /  I^£(P>*K(x)/(x|p)dG(p) 

Jn  sex  »= 1 


-c-E 

*ex 


^<*»(X)<P»(X») 


L»= l 


(2.3) 


/U) 


where 


/(*) 

fi(xi) 

a(xt) 

<P«(X«) 

C 


—  n  /»(x«)> 

»=i 

=  //»(x*|p)<*Gt(p)  =  (Xi£fx)  /px,(l  -  p)TdGi{p )  =  a(xt)/i,(x,), 
o  o 

=  (*‘r+-7l)»  Mx<)  =  /pEi(l  -p)rdG,(p), 

0 

=  ht(xt-  +  1  )/hi(xi)  (note  that  0  <  ^?t(x,)  <  1),  and 

=  £  /p[*]/(x|p)<*G(p). 

*€X  n 


(2.4) 


Note  that  C  is  a  constant  which  is  independent  of  the  selection  rule  d.  Thus,  from 
(2.3),  Bayes  selection  rules  can  be  obtained  as  follows. 

For  each  x  €  X ,  let 


A{z)  =  (:V<(xi)  =  max  <pj(xy)}.  (2.5) 

t<j<k 

Any  selection  rule  d  =  (di,...,dk)  such  that  £  ^t(s)  =  1  is  always  a  Bayes  rule.  Thus, 

*€A(i) 

a  randomized  Bayes  rule  is  da  =  (dic,...,dfcG)  where 


if  t  €  A(x), 
otherwise, 


(2.6) 


and  |Aj  denotes  the  cardinality  of  the  set  A. 


Since  the  prior  distribution  G  is  unknown,  it  is  not  possible  to  apply  the  Bayes  rules 
for  the  selection  problem  at  hand.  In  this  case,  we  use  the  empirical  Bayes  approach.  It  is 
assumed  that  past  observations  from  each  of  the  k  populations  are  available. 

For  each  *  =  1, . . . ,  k,  let  (X,y,  *y).  i  =  i.-  ,  n,  be  independent  random  vectors 
associated  with  population  7r,-,  where  P,y  stands  for  the  random  probability  of  success  for 
each  trial  in  7r,  at  stage  j,  and  X,y  stands  for  the  number  of  successes  before  attaining 
the  rth  failure  in  7r,-  at  stage  j.  It  is  assumed  that  Pi:  has  prior  distribution  G,  for  all 
j  =  1,2,....  Conditional  on  P,y  =  p,y ,  Xu  | p,y  has  a  negative  binomial  probability  function 
fi{xij\pij)  given  in  (2.1).  It  should  be  noted  that  Xij  is  observable  but  P,y  is  not.  Let 
the  jth  stage  observations  be  denoted  by  2Cy.  That  is,  2Cy  =  (ATiy, . . . ,  Xkj).  From  the 
assumptions,  Xt ,...,Xn  are  mutually  independent  and  identically  distributed.  We  also 
let  Xn+1  =  X.  =  (-^l , . . . ,  Xk)  denote  the  observation  at  the  current  stage. 

From  (2.5)  and  (2.6),  a  natural  empirical  Bayes  selection  rule  can  be  defined  as  follows: 
For  each  i  =  1, . . . ,  fc,  and  n  =  1,2,...,  based  on  the  past  data  Xt  l , . . . ,  Xtn  and  the  present 
observation  Xi  =  z,-,  let  <p,„(z ,)  =  <£>,n(x,-;  X,i, . . . ,  X,n)  be  an  empirical  Bayes  estimator 
of  Then,  by  letting 

AnU)  =  {t'|Pin(a;,)  =  ma*^yn(xy)},  (2.7) 

an  empirical  Bayes  selection  rule  dn(x)  =  (din(x),. . .  ,dfcn(x))  is  defined  as  below. 

*.(*)-(  (2.8) 
( 0  otherwise. 

Note  that  the  past  data  (2C1} . . .  ,2£n)  is  implicitly  contained  in  the  subscript  n. 
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For  such  an  empirical  Bayes  selection  rule  dn,  let  r(G,  dn)  be  the  corresponding  overall 
Bayes  risk.  That  is, 

r  _  * 

r(G,dn)=E  /  ^L(p,i)din(i)/(x|p)dG(p) 

/nxgr«= i 

=  C  -  /U)> 

*er  L«=i 

where  the  expectation  E  is  taken  with  respect  to  (2£i>  •  •  •  ,2£n)-  Since  r(G)  is  the  minimum 
Bayes  risk,  r(<3,  dn)  —  r(G)  >  0  for  all  n.  Thus,  the  nonnegative  difference  r(G,d„)  -  r(G ) 
is  used  as  a  measure  of  the  optimality  of  the  empirical  Bayes  selection  rule  dn. 

Definition  2.1.  A  sequence  of  empirical  Bayes  selection  rules  {dn}^L1  is  said  to  be  asymp¬ 
totically  optimal  relative  to  the  (unknown)  prior  distribution  G  if  r(G,dn)  —  r(G)  — *  0  as 
n  — +  oo. 

In  the  following,  we  seek  a  sequence  of  asymptotically  optimal  empirical  Bayes  selec¬ 
tion  rules  for  the  selection  problem  under  study. 


(2.9) 


3.  THE  PROPOSED  EMPIRICAL  BAYES  SELECTION  RULE 

Before  we  go  further  to  construct  empirical  Bayes  rules  for  the  selection  problem  at 
hand,  we  first  investigate  some  properties  related  to  the  Bayes  selection  rule  dc  defined  in 
(2.6). 

Definition  3.1.  A  selection  rule  d  =  (di,...fdk)  is  said  to  be  monotone  if  for  each  »  = 
1  di(s)  is  increasing  in  while  all  other  variables  x;  are  fixed,  and  decreasing  in 

Xj,  for  each  j  ^  t  while  all  other  variables  are  fixed. 

Note  that  for  each  i  =  1, ...  ,/s,  £>t(x,)  =  ht(x,  +  l)/h,(x,).  Straight  computations 
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show  that  tpi(x ,)  is  an  increasing  function  of  x,.  Thus,  from  (2.5)  and  (2.6),  one  can  see 
that  the  Bayes  selection  rule  dc  is  a  monotone  selection  rule.  Also,  note  that  pt(x ,)  is  the 
posterior  mean  of  P,  given  X,  =  x,,  and  it  is  the  Bayes  estimator  of  P,  given  X,  =  x,  for 
squared  error  loss.  Under  the  squared  error  loss,  the  problem  of  estimating  the  probability 
of  success  in  a  negative  binomial  distribution  is  a  monotone  estimation  problem.  By 
Theorem  8.7  of  Berger  (1985),  for  a  monotone  estimation  problem,  the  class  of  monotone 
estimators  form  an  essential  complete  class.  Also,  for  the  present  selection  problem,  under 
the  loss  function  given  in  (2.2),  the  problem  is  a  monotone  decision  problem.  Again,  from 
Berger  (1985),  the  class  of  monotone  selection  rules  is  essentially  complete.  Now,  one  can 
see  that  if  the  empirical  Bayes  estimators  £>i„(xt),  i  =  1  are  monotone,  then  the 

empirical  Bayes  selection  rule  given  through  (2.7)  and  (2.8)  is  also  monotone.  From  these 
considerations,  it  is  reasonable  to  desire  that  the  concerned  estimators  {<£>,■„ (xt)}  possess 
the  above-mentioned  monotonicity  property. 

For  each  i  =  1, . . .  ,k,  let  JV,n  =  max(X,i, . . . ,  X,„)  —  1.  For  each  x  =  0, 1, 2, . . .,  let 

fin{%)  =  ~  £ /{.}(*«),  (3-1) 

;=i 

where  I  a  denotes  the  indicator  function  oi  the  set  A.  Let 

hin{x)  =  /,n(x)/o(x).  (3.2) 

It  is  intuitive  to  use  hin(x)  as  an  estimator  of  h,(x).  However,  note  that  /i,(x)  is  nonin¬ 
creasing  in  x;  while  /i,-n(x )  may  not  possess  the  nonincreasing  property.  Thus,  we  consider 
a  smoothed  version  of  &fn(x).  Note  that  hin{x)  =  0  if  x  >  iV,n  +  l.  Thus,  let  {hJn(x)}^L"0+1 
be  the  antitonic  regression  of  {^tn(x)}^i'b4’1  with  equal  weights,  and  let  h*n(y)  =  0  for  all 
y  >  Nin  +  1-  Thus,  htn(x)  is  nonincreasing  in  x. 
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For  x  =  0, 1, ... ,  Nin ,  let 


<Pin(x)  =  h’n[x  +  1  )/h-n{x).  (3.3) 

Since  <Pin{x)  may  not  possess  the  nondecreasing  property  as  £>,(x)  does,  we  consider 
a  smoothed  version  of  <Pin[x)  as  follows.  Let  be  the  isotonic  regression 

of  {¥?m(x)}*=o  with  random  weights  {At?n(x)}£Lv  For  y  >  Nin,  we  define  <p‘n(y)  = 
<p*n(iV,n).  Therefore  <p*n(x)  is  nondecreasing  in  x.  Note  that  0  <  <p*n(x)  <  1  for  all 
x  =  0, 1,2, . . ..  We  use  <p‘n{x )  to  estimate  <Pi(x)  and  propose  an  empirical  Bayes  selection 
rule  d'n  =  (d*ln, . . .  J  as  follows. 

For  each  x  =  (xi,.. .  ,  x*)  €  X,  let 


Kiz)  =  {»!»><«(*»)  = 


(3.4) 


and  for  each  i  =  1, ...» k,  let 


<»(*) 


=  { 


i^;(^)r1  i  f  »€-*;(*), 

0  otherwise. 


(3.5) 


Since  <pfn(xi)  is  nondecreasing  in  for  each  *  =  1  we  see  that  d*(x)  is  a 

monotone  empirical  Bayes  selection  rule. 

Remark  3.1.  For  each  y  =  0,1,2,...,  let  Hin(y)  =  £  /t,„(i),  #t*n(y)  =  E  hin(x)’  and 

1=0  x—0 

#,(y)  =  it  hi(x).  Also,  for  each  y  =  0,l,...,iVt„,  let  ¥in(y)  =  E  <Pin{x)h'in{x),  and 


x=0 


2=0 

*in(y)  =  E  PinWKni*)-  Tiien,  by  the  definition  of  <pin[x),  rpin{y)  =  E  Knix  +  1)  = 

x=0  x=0 

H‘n(y  +  l)  —  H*n( 0)  for  0  <  y  <  iV,n.  Also,  from  Barlow,  et  al.  (1972),  and  by  noting  the 
fact  that  hin(y)  =  h^n(y)  =  0  for  all  y  >  Nin  +  1,  we  have  the  following  results: 


#,*n(y)  >  Hin(y)  for  all  y  =  0, 1,2, . . . 


(3.6) 
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(3.7) 


snV\Hin{y)  -  Hi{y) I  <  supj Hin(y)  -  Ht{y)\. 
y>o  y>o 


*;«(y)  <  *«(»)  for  all  y  =  0, 1, . . . 


Remark  3.2.  From  Puri  and  Singh  (1988),  we  have 


^n(O)  = 

•»*(*)  -  -  ♦;.<*  -  i»/w.(»)  -  - 1))). 

From  (3.8),  (3.9)  and  (3.10)  we  obtain 

>  ,s^, .!(**•(»)  -  «»(*  - 1 Mb;m  -  H-jx  - 1))] 
=  ,<™inKJW»h  + 1)  -  u;n(z))/(HU  y)  -  h;„(  i  - 1))] 

for  each  x  =  0, 1, . .  .,Nin,  where  #,*„(-!)  =  0. 


S-gmark  3.3.  Analogous  to  Puri  and  Singh  (1988),  we  can  obtain  an  alternative 
<Pin{x)  as  follows. 


^in{Nin)  =  max 
0<y<Nin 


'Ni, 


Nin 


L  V=y 


r-y 


Vin{x)  =  nmax 

0<y  <z 


Nii%  \  I 

£ *>»w*:.(r) -  £  awcw  /tew 

r=i  +  l 


v=y 


T—y 


for  x  =  0, 1, ,  ]Vjn  —  1,  and 


^«n  Nin 

X  V?m(r)h*n(r)  <  X  Pin(r)hin(r)>  for  x  =  0, 1,. ..  ,Nin. 


Thus,  we  can  obtain 


V'nix)  <  max 

0<y<  x 


<Pin(r)htn(r)  ~  ^in(r)^fn(r)  ]  /  X!  ^in(r) 


\r—V 


r=x+l 


r—y 


=  og?JW»(*  +  ^  /  (**.(*)  -  5*n(y  -  1))] 


(3.8) 

(3.9) 
.  (3.10) 

(3.11) 

form  of 

(3.12) 

(3.13) 

(3.14) 

(3.15) 
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for  each  x  =  0, 1, . . . ,  iV,n. 


4.  ASYMPTOTIC  OPTIMALITY 

In  this  section,  we  investigate  the  asymptotic  optimality  property  of  the  sequence  of 
empirical  Bayes  selection  rules  {d*}  defined  previously. 

For  1  <  *  <  j  <  k,  and  each  x*  =  0,1,2...,  let  A,y(xt)  =  {xy| v?y(xy)  >  p»(x,-)}  and 

=  {xj\<pj[xj)  <  <Pi{xi)}.  Let 

w.  —  _  /  min  ^»y(*»)  if  #  <t>’ 

\  oo  if  A,y(xi)  =  <f>, 

max  Bij(xi)  if  J3,y(x,)  #  <£, 


.  .  /  mas 

Note  that,  by  the  nondecreasing  property  of  <py(x y),  m,y(xt)  <  Af,y(xt)  and  m,y(xt)  < 
Af,y(xi)  if  Aij(xi)  ^  4>.  Let 

ti  —0  i,  =0 

*•=0  *y=M,y(zj) 

fflij  (*t )  OO 

where  5Z  =  0  if  m*y(x»)  =  —1  and  =  0  if  Af<y(xi)  =  oo. 

*,=0 

From  (2.5)  and  (2.9),  a  straightforward  computation  leads  to  that 

o  <!•(<?,<)  -r(G) 

Jfe—  1  Ac 

*=i y=*+i 

Thus,  in  order  to  investigate  the  asymptotic  behavior  of  the  difference  r(G,  d*)  —  r(G),  it 
suffices  to  consider  the  case  where  k  =  2.  We  claim  that 


Theorem  4.1  r12(n)  ->0asn-*cc. 
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Theorem  4.1  implies  that  the  sequence  of  empirical  Bayes  selection  rules  {<£*}  is 
asymptotically  optimal.  In  the  rest  of  this  section,  we  are  going  to  prove  Theorem  4.1. 
The  following  lemmas  are  useful  in  presenting  a  concise  proof  of  Theorem  4.1. 

Lemma  4.1  Let  {am}  he  a  sequence  of  real  numbers  and  {6m}  be  a  sequence  of  positive 
numbers  such  that  bm  <  1  and  bm  is  nonincreasing  in  m.  Then,  for  each  positive  constant 
e, 


sup 

n>l 


y:  ambm |  >  (>)c  implies  that  sup  |  ^  am\  >  (>)c. 


-  m=  1 


n>  1 


-  m=  1 


Since  this  lemma  is  trivial,  the  proof  is  omitted  here.  The  following  is  a  consequent 
result  of  Lemma  4.1. 

Z 

Corollary  4.1  For  each  t  =  1  and  z  =  0,1,2...,  let  F{(z)  =  £  /i(x)>  Fin{z)  = 

z=0 

z 

fin(x).  Then,  for  fixed  positive  constant  c, 

1=0 


sup|5‘tn(z)  —  Hi(z)\  >  (>)c  implies  that  sup|.Ftn(z)  —  F,(z)|  >  (>)c. 

z> 0  z> 0 


Proof:  There  is  a  direct  result  of  Lemma  4.1  by  noting  that  h,„(x)  —  h,(x)  =  (/,n(x)  - 
/,(x))/a(x),  where  1  /a(x)  =  (x^’l71)  1  —  *  which  is  decreasing  in  x. 

Lemma  4.2.  For  each  fixed  x  =  0,1,2...,  and  0  <  t  <  <Pi{x),  let 

<2»(y|*,0  =  [-^i(y+l)+fi’i(x)]-h[ff,(y)-ffj(x-l)][¥?t  (x)-t]  for  all  y  =  x,x+l,....  Then, 
|x, t)  is  nonincreasing  in  y  and  therefore,  maxQ,(y|x,t)  =  <5,(x|x,t)  =  — i/i<(x)  <  0. 

y>z 
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Proof:  Qi(y\x,  f)  -  Q,(y  +  l|x,£) 

=  [Hi(y  +  2)-  Hi(y  +  1)!  +  [Ht(y)  -  Ht( y  +  l)]b,(x)  -  t\ 

=  hi(y  +  2)  -  hi(y  +  l)[<p,(x)  -  t] 

=  hi{y  +  l)[<Pi(y  +  l)  -  <pi{x)  +  t] 

>  hi(y  +  l)£ 

>  0 

where  the  first  inequality  is  due  to  the  fact  that  y  >  x  and  thus  <pi(y  +  1)  >  <p,(x).  There¬ 
fore,  <Pi{y\x,t)  is  nonincreasing  in  y  for  all  y  >  x,  and  hence  maxQ,(y|x, t)  =  (J,(x|x,  t)  = 

y>x 

-thi(x)  <  0. 

Lemma  4.3.  For  each  x  =  0,1,2,...,  and  0  <  t  <  1  —  <Pi(x),  let 

Ri{y\x,t)  =  \-Hi{x  +  1)  +  JJ,(y)]  +  -  H{{y  -  l)]  [<pi(x)  +  t]  for  all  y  =  0, 1, . . .  ,x, 

where  1)  =  0.  Then,  i2t(y|x,£)  is  nonincreasing  in  y  for  y  =  0, 1, . . .  ,x,  and  hence, 
^min  i2,(y|x,£)  =  i2,(x|x,t)  =  th\(x )  >  0. 

Proof:  For  0  <  y  <  x  —  1, 

Ri(y\x,t)  -iZ,(y  +  l|x,f) 

~  y  +  l)j  +  (#i(y)  -  Hi{y  -  l)j[^(x)  +  t] 

=  -  hi(y  +  1)  +  h,(y)[v5,  (x)  +  f] 

+  <pi{x)  +  t] 

>hi{y)t 

>0 

since  0  <  y  <  x  -  1  and  thus  <p,(x)  —  <Pi(y)  >  0.  Therefore,  i2,(y|x,£)  is  nonincreasing  in 
y  for  y  =  0,1,..., x,  and  hence  min  Ri{y\x,t)  =  J2,(x|x,t)  =  hi(x)t  >  0. 
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Lemma  4.4.  For  each  t  =  1, . . . ,  A:,  t  >  0  and  x  =  0, 1, 2, . . . 

P{<Pin{x)  ~  Viix)  <  “0  <  [^i(x)]n  +  dexj>(—nt2h2(x)/8) 

where  d  Is  a  positive  constant  which  is  independent  of  the  distribution  F,-. 

Proof:  Note  that  0  <  y??n(x),  £>t(x)  <  1.  Hence  F{v??n(x)  —  <Pi(x)  <  —  f}  =  0  if  t  >  <pi(x). 
Thus,  in  the  following,  it  suffices  to  consider  those  t  such  that  0  <  t  <  <p,  (x). 

Now,  for  0  <  t  <  <pi(x), 

P{<Pinix )  -  P»(x)  <  -0 

=P{<Pinix)  -  <Piix)  <  -t  and  Nin  <  x}  +  P{<pt*n(x)  -  <pi[x)  <  -t  and  Nin  >  x}, 
where 

P{<Pinix)  ~  <Pi(x)  <  -t  and  Nin  <x}<  [F,(x)]n  (4.2) 

which  is  obtained  by  the  definition  of  iVin. 

Let  Tin(x)  =  Hin(x)  ~  H'(x)  and  T,n(x)  =  Hin[x)  -  Hi[x).  From  (3.11),  (3.7), 
Corollary  4.1,  Lemma  4.2  and  Lemma  2.1  of  Schuster  (1969)  and  by  noting  the  fact  that 
0  <  t  <  <pi(x)  <  1,  we  have 


P{<p’in(x)  -  Piix)  <  and  Nin  >  2} 

<P{H-n{y  +  1)  -  H-n{ x)  -  [. H-n(y )  -  H-n{x  -  l)][^,(x)  -  t\  <  0  for  some  x  <  y  <  Nin} 
=P{T-n{y  +  1)  -  T-n{x)  -  [TfM  -  T*n (x  -  l)][*?,(x)  -  t]  <  Qi{y\x,t)  for  some  x  <  y  <  Nin) 
<P{T-n{y  +  1)  -  T-n{x)  -  [r;n(y)  -  T-n[x  -  l)][*>,(x)  -t]<  -th^x)  for  some  x  <  y  <  Nin} 
<P{  sup  \T;n{z)\  >  thi{x)/4}  (4.3) 

*<*<Nin  +  l 

<P{sup|27n(z)|  >  thi(x)/4} 

z>  0 

<P{sup|Ti„(z)|  >  thi(x)/4} 

z>  0 

<P{sup|P,n(z)  -  Fi(x)\  >  thi(x)/4} 

z>0 

<dexp(—nt2h2(x)/S). 

Thus,  from  (4.1),  (4.2)  and  (4.3),  the  proof  of  this  lemma  is  complete. 

Lemma  4.5.  For  each  i  =  1, . . . ,  k,  t  >  0  and  x  =  0, 1, 2, . . ., 

P{Pin(x)  ~  <Pi(x)  >  0  <  [Pi (2) ] n  +  dexp(-n*2/i?(x)/8) 

where  d  is  a  positive  constant  which  is  independent  of  the  distribution  P,. 

Proof:  The  proof  of  this  lemma  uses  the  results  of  (3.7),  (3.14),  Corollary  4.1,  Lemma  4.3 
and  Lemma  2.1  of  Schuster  (1969).  The  argument  for  the  proof  of  this  lemma  is  similar 
to  that  of  Lemma  4.4  We  omit  the  detail  here. 
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Proof  of  Theorem  4.1 


Let  A(xi,i2)  =  £>1(2:1)  —  £>2(x2).  Note  that 

00 

rl2{n)  =  ^2  A(xi,X2).P{y?In(xi)  <  £,2n(a:2)}/l  (2:1) /2(2:2) 

Xi=0  12=0 

+  ^2  (-A(xi,X2))P{^Jn(xi)  >  £>2r,(;E2)}/l(2:i)/2(x2)- 

xj=0  xa=Mi3(xi) 


For  X2  <  mi2(xi),  £>i(xi)  -  ^2(2:2)  >  0  and 
P{Vl»(*l)  <  <P2n(I2)} 

f  .  ,  ^  ^  <Pl(xi)  ~  <p2(x2)  } 

<P  j^in^l)  ~  <PlM  ^ - 2 - J 

+  P  j^nta)  ~  £>2(x2)  >  - j - J  ' 

For  x2  >  M12(xi),  <£i(si)  -  £?2(x2)  <  0  and, 

P{v?;n(*l)  >  £>2»(* 3)} 

f  .  ,  ^  ^  ¥>2(12)  -<pi(iiH 

<p  |£>i»(*i)  -^1(2:1)  > - 2 - J 

.  «  f  .  /  x  ,  x  ^  £>2(X2)  -£2l(3:i)  ) 

+  P  “  £>2(^2)  < - £ - |  • 


(4.5) 


Also,  0  <  fcOrgateal  <  min(£>i(x1),  1  -  £22(3:2))  as  x2  <  m12(xi),  and  0  < 
pa(*a)-pi(ztl  <  mjn(v?2(x2),  1  -  £?i(xi))  as  x2  >  Afi2(xi).  Then,  from  (4.4)-(4.6), 
Lemmas  4.4  and  4.5,  one  can  obtain  the  following: 


!•»(»)  <  *W(*i)]  +  E[FJ{X,)\  (4.7) 

OO  OO 

+  d  /1  (®i)/2 (3*2)  [exp(-nA2(xi,x2)/i?(xi)/8)  +exp(-nA2(x1,x2)/i^(x2)/8)] 

x  1  =0  xa=0 

where  each  of  the  three  terms  at  the  right-hand-side  of  (4.7)  tends  to  0  as  n  tends  to 
infinity.  This  implies  that  ri2(n)  — ►  0  as  n  — ►  00.  Hence  the  proof  of  the  theorem  is 
complete. 
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